An Exact Algorithm for F-Measure Maximization

نویسندگان

  • Krzysztof Dembczynski
  • Willem Waegeman
  • Weiwei Cheng
  • Eyke Hüllermeier
چکیده

The F-measure, originally introduced in information retrieval, is nowadays routinely used as a performance metric for problems such as binary classification, multi-label classification, and structured output prediction. Optimizing this measure remains a statistically and computationally challenging problem, since no closed-form maximizer exists. Current algorithms are approximate and typically rely on additional assumptions regarding the statistical distribution of the binary response variables. In this paper, we present an algorithm which is not only computationally efficient but also exact, regardless of the underlying distribution. The algorithm requires only a quadratic number of parameters of the joint distribution (with respect to the number of binary responses). We illustrate its practical performance by means of experimental results for multi-label classification.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

F-Measure Maximization in Multi-Label Classification with Conditionally Independent Label Subsets

We discuss a method to improve the exact F-measure maximization algorithm called GFM, proposed in [2] for multi-label classification, assuming the label set can be partitioned into conditionally independent subsets given the input features. If the labels were all independent, the estimation of only m parameters (m denoting the number of labels) would suffice to derive Bayes-optimal predictions ...

متن کامل

SPECT Reconstruction Using the Expectation Maximization Algorithm and an Exact Inversion Formula

We develop the theory behind the Expectation Maximization algorithm and an exact inversion formula for the attenuated Radon transform, two reconstruction methods used in SPECT. We also implement both methods and present a few numerical experiments.

متن کامل

Stochastic Optimization: a Review

We review three leading stochastic optimization methods—simulated annealing, genetic algorithms, and tabu search. In each case we analyze the method, give the exact algorithm, detail advantages and disadvantages, and summarize the literature on optimal values of the inputs. As a motivating example we describe the solution—using Bayesian decision theory, via maximization of expected utility—of a...

متن کامل

Maximum Likelihood Parameter Estimation

The problem of estimating the parameters for continuous-time partially observed systems is discussed. New exact lters for obtaining Maximum Likelihood (ML) parameter estimates via the Expectation Maximization algorithm are derived. The methodology exploits relations between incomplete and complete data likelihood and gradient of likelihood functions, which are derived using Girsanov's measure t...

متن کامل

Steepest descent on factor graphs

x f(x, θ) log f(x, θ) exists for all θ and θ. In principle, one can apply the sum-product algorithm in order to find (1), which involves the following two steps [2]: 1. Determine f(θ) by sum-product message passing. 2. Maximization step: compute θmax △ = argmaxθ f(θ). This procedure is often not feasible, since • When the variable x is continuous, the sum-product rule may lead to intractable in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011